Information Gain Ratio meets Maximal Marginal Relevance - A method of Summarization for Multiple Documents
نویسندگان
چکیده
In this paper, we propose a method to make a summary from multiple documents with taking account of comprehensibility and readability. As for comprehensibility, we show an integration of MMR into the termweighting method based on IGR. As for readability, we propose a method to generate a summary based on clustering important sentences according to subtopics and making a keyword list as a very brief summary for each cluster. By the evaluation in NTCIR3 TSC2, we show that the proposed method works well to generate comprehensive summaries when the length of summary is short and the target is a small (7 or less) number of documents.
منابع مشابه
Summarization: (1) Using MMR for Diversity- Based Reranking and (2) Evaluating Summaries
This paper 1 develops a method for combining queryrelevance with information-novelty in the context of text retrieval and summarization. The Maximal Marginal Relevance (MMR) criterion strives to reduce redundancy while maintaining query relevance in reranking retrieved documents and in selecting appropriate passages for text summarization. Preliminary results indicate some benefits for MMR dive...
متن کاملQuery-Focused Multidocument Summarization Based on Hybrid Relevance Analysis and Surface Feature Salience
Query-focused multidocument summarization is to synthesize from a set of topic-related documents a brief, well-organized, fluent summary for the purpose of answering an information need that cannot be met by just stating a name, date, quantity, etc. In this paper, the task is essentially treated as a sentence retrieval task. We propose a hybrid relevance analysis to evaluate the relevance of a ...
متن کاملSummarization of Multiple
In this era, where electronic text information is exponentially growing and where time is a critical resource, it has become virtually impossible for any user to browse or read large numbers of individual documents. It is therefore important to explore methods of allowing users to locate and browse information quickly within collections of documents. Automatic text summarization of multiple doc...
متن کاملData Summarization Using Maximal Marginal Relevance Method
The search for interesting information in a huge data collection is a tough job frustrating the seekers for that information. The automatic text summarization has come to facilitate such searching process. Automatic text summarization is to compress an original document into an abridged version by extracting almost all of the essential concepts with text mining techniques. The selection of dist...
متن کاملProject Underline - A Government Perspective
The purpose of the TIPSTER contract with Carnegie Group, Inc. (CGI) of Pittsburgh, PA is to promote and further develop automatic Text Summarization using a Maximal Marginal Relevance (MMR) metric to generate summaries of documents that are directly relevant to the information need of an individual user. CGI subcontracts with Carnegie Mellon University to perform most of its linguistic research.
متن کامل